Picture for Jing Shi

Jing Shi

YoChameleon: Personalized Vision and Language Generation

Add code
Apr 29, 2025
Viaarxiv icon

Accelerating Multi-Objective Collaborative Optimization of Doped Thermoelectric Materials via Artificial Intelligence

Add code
Apr 11, 2025
Viaarxiv icon

Visual Persona: Foundation Model for Full-Body Human Customization

Add code
Mar 19, 2025
Viaarxiv icon

MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities

Add code
Jan 15, 2025
Viaarxiv icon

Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage

Add code
Dec 24, 2024
Viaarxiv icon

GUI Agents: A Survey

Add code
Dec 18, 2024
Viaarxiv icon

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

Add code
Dec 13, 2024
Viaarxiv icon

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity

Add code
Nov 23, 2024
Figure 1 for FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Figure 2 for FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Figure 3 for FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Figure 4 for FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Viaarxiv icon

GroundingBooth: Grounding Text-to-Image Customization

Add code
Sep 13, 2024
Viaarxiv icon

Topological GCN for Improving Detection of Hip Landmarks from B-Mode Ultrasound Images

Add code
Aug 24, 2024
Figure 1 for Topological GCN for Improving Detection of Hip Landmarks from B-Mode Ultrasound Images
Figure 2 for Topological GCN for Improving Detection of Hip Landmarks from B-Mode Ultrasound Images
Figure 3 for Topological GCN for Improving Detection of Hip Landmarks from B-Mode Ultrasound Images
Figure 4 for Topological GCN for Improving Detection of Hip Landmarks from B-Mode Ultrasound Images
Viaarxiv icon